Picture for Siheng Chen

Siheng Chen

School of Artificial Intelligence, Shanghai Jiao Tong University

EmboCoach-Bench: Benchmarking AI Agents on Developing Embodied Robots

Add code
Jan 29, 2026
Viaarxiv icon

$G^2$-Reader: Dual Evolving Graphs for Multimodal Document QA

Add code
Jan 29, 2026
Viaarxiv icon

AgentIF-OneDay: A Task-level Instruction-Following Benchmark for General AI Agents in Daily Scenarios

Add code
Jan 28, 2026
Viaarxiv icon

Toward Efficient Agents: Memory, Tool learning, and Planning

Add code
Jan 20, 2026
Viaarxiv icon

Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering

Add code
Jan 15, 2026
Viaarxiv icon

Deploy-Master: Automating the Deployment of 50,000+ Agent-Ready Scientific Tools in One Day

Add code
Jan 07, 2026
Viaarxiv icon

Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale

Add code
Dec 23, 2025
Figure 1 for Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale
Figure 2 for Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale
Figure 3 for Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale
Figure 4 for Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale
Viaarxiv icon

PhysMaster: Building an Autonomous AI Physicist for Theoretical and Computational Physics Research

Add code
Dec 22, 2025
Viaarxiv icon

Unveiling the Impact of Data and Model Scaling on High-Level Control for Humanoid Robots

Add code
Nov 12, 2025
Figure 1 for Unveiling the Impact of Data and Model Scaling on High-Level Control for Humanoid Robots
Figure 2 for Unveiling the Impact of Data and Model Scaling on High-Level Control for Humanoid Robots
Figure 3 for Unveiling the Impact of Data and Model Scaling on High-Level Control for Humanoid Robots
Figure 4 for Unveiling the Impact of Data and Model Scaling on High-Level Control for Humanoid Robots
Viaarxiv icon

InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents

Add code
Oct 02, 2025
Viaarxiv icon